





MANAGING MEGABYTES 





A Better Way to 
Compress Images 


Mathematics is providing a novel technique for achieving 
compression ratios of 10,000 to 1—and higher 


Michael F. Barnsley and Alan D. Sloan 


THE NATURAL WORLD is filled with 
intricate detail. Consider the geometry 
on the back of your hand: the pores, the 
fine lines, and the color variations. A 
Camera can capture that detail and, at 
your leisure, you can study the photo to 
see things you never noticed before. Can 
personal computers be made to carry out 
similar functions of image storage and 
analysis? If so, then image compression 
will certainly play a central role. 

The reason is that digitized images— 
images converted into bits for processing 
by a computer—demand large amounts of 
computer memory. For example, a high- 
detail gray-scale aerial photograph might 
be blown up to a 344-foot square and then 
resolved to 300 by 300 pixels per square 
inch with 8 significant bits per pixel. 
Digitization at this level requires 130 
megabytes of computer memory—too 
much for personal computers to handle. 

For real-world images such as the aeri- 
al photo, current compression techniques 
can achieve ratios of between 2 to 1 and 
10 to 1. By these methods, our photo 
would still require between 65 and 13 
megabytes. 

In this article, we describe some of the 
main ideas behind a new method for 
image compression using fractals. The 
method has yielded compression ratios in 
excess of 10,000 to 1 (bringing our aerial 
photo down to a manageable 13,000 
bytes). The color pictures in figures 1 
through 5 were encoded using the new 
technique; actual storage requirements 
for these images range from 100 to 2000 
bytes. 


A mathematics research team at the 


Georgia Institute of Technology is devel- 
oping the system, with funding provided 
by the Defense Advanced Research Proj- 
ects Agency (DARPA) and the Georgia 
Tech Research Corporation (GTRC). Our 
description is necessarily simplified, but 
it will show you how a fractal image- 
compression scheme operates and how to 
use it to create exciting images. 


Describing Natural Objects 
Traditional computer graphics encodes 
images in terms of simple geometrical 
shapes: points, line segments, boxes, cir- 
cles, and so on. More advanced systems 
use three-dimensional elements, such as 
spheres and cubes, and add color and 
shading to the description. 

Graphics systems founded on tradi- 
tional geometry are great for creating pic- 
tures of man-made objects, such as 
bricks, wheels, roads, buildings, and 
cogs. However, they don’t work well at 
all when the problem is to encode a sun- 
set, a tree, a lump of mud, or the intricate 
structure of a black spleenwort fern. 
Think about using a standard graphics 
system to encode a digitized picture of a 
cloud: You’d have to tell the computer the 
address and color attribute of each point 
in the cloud. But that’s exactly what an 
uncompressed digitized image is—a long 
list of addresses and attributes. 

To escape this difficulty, we need a 
richer library of geometrical shapes. 
These shapes need to be flexible and con- 
trollable so that they can be made to con- 
form to clouds, mosses, feathers, leaves, 
and faces, not to mention waving sun- 
flowers and glaring arctic wolves. Fractal 


geometry provides just such a collection 
of shapes. For a hint of this, glance at the 
pictures in The Fractal Geometry of Na- 
ture by Benoit Mandelbrot, who coined 
the term fractal to describe objects that 
are very “fractured” (see references for 
additional books and articles). Some ele- 
mentary fractal images accompany this 
article. 

Using fractals to simulate landscapes 
and other natural effects is not new; it has 
been a primary practical application. For 
instance, through experimentation, you 
find that a certain fractal generates a pat- 
tern similar to tree bark. Later, when you 
want to render a tree, you put the tree- 
bark fractal to work. 

What is new is the ability to start with 
an actual image and find the fractals that 
will imitate it to any desired degree of ac- 
curacy. Since our method includes a com- 
pact way of representing these fractals, 
we end up with a highly compressed data 
set for reconstructing the original image. 


Overview of Fractal Compression 
We start with a digitized image. Using 
image-processing techniques such as 
color separation, edge detection, spec- 
trum analysis, and texture-variation 
analysis, we break up the image into seg- 
ments. (Some of the same techniques 
continued 
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Figure 1: [FS-encoded color image of 
three-dimensional ferns (4 
transformations, 100 bytes). 





Figure 2: [FS-encoded color photo of 
Black Forest, color set adjusted to give 
winter tones (120 transformations, 
2000 bytes). 





Figure 3: [FS-encoded color photo of a 
Bolivian girl (120 transformations, 2000 
bytes). 








form the basis for the automatic coloring 
of black-and-white motion pictures.) A 
segment might be a fern, a leaf, a cloud, 
or a fence post. A segment can also be a 
more complex collection of pixels: A sea- 
scape, for example, may include spray, 
rock, and mist. 

We then look up these segments in a li- 
brary of fractals. The library doesn’t 
contain literal fractals; that would require 
astronomical amounts of storage. In- 
stead, our library contains relatively 
compact sets of numbers, called iterated 
function system (IFS) codes, that will re- 
produce the corresponding fractals. Fur- 
thermore, the library’s cataloging system 
is such that images that look alike are 
close together: Nearby codes correspond 
to nearby fractals. This makes it feasible 
to set up automated procedures for 
searching the library to find fractals that 
approximate a given target image. A 
mathematical result known as the Collage 
Theorem (more on that later) guarantees 
that we can always find a suitable IFS 
code—and gives a method for doing so. 

Once we have looked up all the seg- 
ments in our library and found their IFS 
codes, we can throw away the original 
digitized image and keep the codes, 
achieving our compression ratio of 
10,000 to 1—or even higher. 


Iterated Function Systems 
We start by explaining how a set of IFS 
codes can approximate a natural image. 
IFS theory is an extension of classical 
geometry. It uses affine transformations, 
explained below, to express relations be- 
tween parts of an image. Using only these 
relations, it defines and conveys intricate 
pictures. With IFS theory, we can de- 
scribe a cloud as clearly as an architect 
can describe a house. 
By studying the following sections, 





Figure 4: IFS-encoded color photo of 
the Monterey coast (60 transformations, 
100 bytes). 





you should be able to encode and decode 
fascinating black-and-white image seg- 
ments, such as leaf skeletons, tree 
shadows, spirals, and thunderheads. You 
should also obtain an overview of how a 
fully automated fractal compression sys- 
tem operates. 

Affine transformations can be de- 
scribed as combinations of rotations, 
scalings, and translations of the coordi- 
nate axes in n-dimensional space. An ex- 
ample in two dimensions is 


W(x,y) = (xt My+1, Mxt4y+2), 


which can also be written in matrix form 
as 


aoneeal an} 


This transformation moves the point (0,0) 
to (1,2) and moves (—1,0.5) to (0.625, 
2). To confirm your understanding of the 
idea, you should work out where it moves 
the point (1,1). We denote this transfor- 
mation by W; the notation W(S) denotes 
the subimage of Won a set of points S. 

Now let’s see what W does to a picture 
of a smiling face, F, lying on the x,y plane 
(see figure 6). The result is a new, 
squeezed face W(F). The affine transfor- 
mation has deformed and moved the face. 
Notice that the eyes in the transformed 
face W(F) are closer together than they 
are in F. We say that the transformation W 
is contractive: It always moves points 
closer together. 

Another example of a contractive af- 
fine transformation is shown in figure 7. 
This time it acts on a leaf to produce a 
new, smaller leaf. 

The general form for an affine trans- 
formation is 





Figure 5: [FS-encoded color image 
from A Cloud Study (30 
transformations, 500 bytes). 





Photographs © 1987 Georgia Tech Research Corp. Figure 1: Barnsley; Figures 2 and 3: Barnsiey, 
Jacquin, Malassenet, Reuter, Sloan; Figures 4 and 5: Barnsley, Jacquin, Reuter, Sloan. 
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where the coefficients a, b, c, d, e, and f 
are real numbers. 

If we know in advance the translations, 
‘rotations, and scalings that combine to 
produce W, we can generate coefficient 
values as follows: 


a=rcos@,b= —ssing, 
c=rsin@,d =scos @¢, 


where r is the scaling factor on x, s is the 
scaling factor on y, 8 is the angle of rota- 
tion on x, ¢ is the angle of rotation on y, e 
is the translation on x, and f is the transla- 
tion on y. 

How can you find an affine transfor- 
mation that produces a desired effect? 
Let’s show how to find the affine trans- 
formation that takes the big leaf to the lit- 
tle leaf in figure 7. We wish to find the 
numbers a, b, c, d, e, and f for which the 
transformation W has the property 


W big leaf) = little leaf. 


Begin by introducing x and y coordinate 
axes, as already shown in the figure. 
Mark three points on the big leaf (we’ve 
chosen the leaf tip, a side spike, and the 
point where the stem joins the leaf) and 
determine their coordinates (a,,a2), 
(81,82), and (1,72). Mark the corre- 
sponding points on the little leaf and 
determine their coordinates (a@1:,Q2), 
(81,62), and (71,72), respectively. 

Determine values for the coefficients 
a, b, and e by solving the three linear 
equations 


Table 1: IFS codes for a Sierpinski triangle. 


b 


0 
0 
0 
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Watobt+e=a,, (1) 
Biat+tfibt+te=f, (2) 
ymatybt+e=y71, (3) 
and find c, d, and f in similar fashion 
from these equations: 
ac+tad+f=a, (4) 
BictBd+f=h, (5) 
ncetydt+f=y. (6) 


We recommend the use of an equation 
solver such as TK Solver Plus (Universal 
Technical Systems, Rockford, Illinois) or 
Eureka (Borland International, Scotts 
Valley, California) for finding the coeffi- 
cient values. Doing it manually can be 

tedious. 
Now that we know what a contractive 
continued 





Figure 6: An affine transformation W 
moves the smiling face F to a new face 
W(F). The transformation is called 
contractive because it moves points 
closer together. 


oe 


Ch. 
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Figure 7: Two ivy leaves fix an affine transformation W. 


Table 3: IFS codes for a fern. 
WwW b c d 


0 0 0 0.16 
0.2 -0.26 0.23 0.22 
-0.15 028 026 0.24 
0.85 0.04 -0.04 0.85 


Table 4: IFS codes for fractal tree. 


WwW a b c d 


0 0 0 0.5 
0.1 0 0 0.1 
0.42 -0.42 0.42 0.42 
0.42 0.42 -042 0.42 
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affine transformation is and how to find 
one that maps a source image onto a de- 
sired target image, we can describe an 
iterated function system. An IFS is a col- 
lection of contractive affine transforma- 
tions. Here’s an example of an IFS of 
three transformations: 


wfz]-[98 8] [5] [3] 
wf? ]=[38 88] [z]+[8] 
mf] = [05 83] [3 ]+[ 3°] 


Each transformation must also have an 
associated probability, p,, determining its 
“importance” relative to the other trans- 








Figure 8: The result of applying the random iteration algorithm to the IFS code in 
table 1. It is called the Sierpinski triangle. 


Figure 9: A fern appears when the random iteration 





formations. In the present case we might 
have p1, P2, and p;. Notice that the proba- 
bilities must add up to 1. That is, p, + 
P2 a P3 = 1. 

Of course, the above notation for an 
IFS is cumbersome. Table 1 expresses 
the same information in tabular form. 
Other examples of IFS codes are given 
in tables 2 through 4. Notice that an IFS 
can contain any number of affine 
transformations. 


The Random Iteration Algorithm 
Now let’s see how to decode an arbitrary 
IFS code using the random iteration 
method. Remember that in general an 
IFS can contain any number, say m, of af- 
fine transformations, W,, W2, Ws,..., 
W,, each with an associated probability. 
The following code summarizes the 
method: 


Ltn ARG: 
¥. v. 


SOOO COs 
rN 
Cn > 


é ws 
52 Ws a2 ww 
Ly 4 Lx On © Ww. 


ows 









Renee OR: bs iss tps 2st, ee 


syns: sono 20: UIE wate: coe 


algorithm is applied to the IFS code in table 3. 


(i) Initialize: x=0, y=0. 

(ii) For n=1 to 2500, do steps (iii)-(vii). 

(iii) Choose k to be one of the numbers 1, 
2,...,m, with probability p,. 

(iv) Apply the transformation W, to the 
point (x,y) to obtain (x,y). 

(v) Set (x,y) equal to the new point: x=X, 


y=y: 
(vi) If n> 10, plot (x,y). 
(vii) Loop. 


Applying this procedure to the trans- 
formation in table 1 produces the figure 
shown in figure 8—a fractal known as the 
Sierpinski triangle. Increasing the num- 
ber of iterations n adds points to the 
image. Figure 9 shows the result of the 
random iteration algorithm applied to the 
data in table 3, at several stages during 
the process. By increasing the scale fac- 
tor used in plotting, you can zoom in on 
the image (see figure 10). The text box on 
page 221 contains a BASIC implementa- 
tion of the method with additional com- 
ments on programming. 

You may wonder why the first 10 
points are not plotted (step (vi)). This is 
to give the randomly dancing point time 
to settle down on the image. It is like a 
soccer ball thrown onto a field of expert 
players: Until someone gains control of 
the ball, its motion is unpredictable, or at 
least is independent of the players’ ac- 
tions. But eventually a player gets the 
ball, and its motion then becomes a direct 
result of the skill of the players. The fact 
that our transformation is contractive 
guarantees that the “ball” will eventually 
get to one of the “players,” and that it 
will stay under control after that. 

How do we know that the random iter- 

continued 
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aton algorithm will produce the same 
image over and over again, independent 
of the particular sequence of random 
choices that are made? This remarkable re- 
sult was first suggested by computer-graph- 
ical mathematics experiments and later 
given a rigorous theoretical foundation by 
Georgia Tech mathematician John Elton. 


The Collage Theorem 
Our next goal is to show a systematic 
method for finding the affine transforma- 
tions that will produce an IFS encoding of 
a desired image. This is achieved with the 
help of the Collage Theorem. 

To illustrate the method, we start from 
a picture of a filled-in square S in the x,y 


Figure 11: The collage theorem is used to encode a classical square S. The correct IFS code is obtained when the four affine 


plane, with its vertices at (0,0), (1,0), 
(1,1), and (0,1) (see figure 11). The ob- 
jective is to choose a set of contractive af- 
fine transformations, in this case W,, W2, 
W;, W,, so that S is approximated as well 
as possible by the union of the four sub- 
images W,(S)U W,(S)U W3(S) U W,(S). 
Figure 11 shows, on the left, S together 
with four noncovering affine transforma- 
tions of it; on the right, the affine trans- 
formations have been adjusted to make 
the union of the transformed images cover 
up the square. 

To find the coefficients of these trans- 
formations, we use the method described 
earlier in the section on iterated function 
systems, leading to simultaneous equa- 


































transformations of S cover S, as shown on the right. 


tions 1 through 3 and 4 through 6. The 
values one finds in the present case are 
given in table 2. When the random itera- 
tion algorithm is applied to this IFS code, 
the square is regenerated. 

The preceding example typifies the 
general situation: You need to find a set 
of affine transformations that shrink dis- 
tances and that cause the target image to 
be approximated by the union of the af- 
fine transformations of the image. The 
Collage Theorem says that the more ac- 
curately the image is described in this 
way, the more accurately the transforma- 
tions provide an IFS encoding of it. 

Figure 12 provides another illustration 
of the Collage Theorem. At the bottom 
left is shown a polygonalized leaf bound- 
ary, together with four affine transforma- 
tions of that boundary. The transformed 
leaves taken together do not form a very 
good approximation of the leaf; in conse- 
quence, the corresponding IFS image 
(bottom right), computed using the ran- 
dom iteration algorithm, does not look 
much like the original leaf image. How- 
ever, as the collage is made more accu- 
rate (upper left), the decoded image 
(upper right) becomes more accurate. 

So, there’s a fundamental stability 
here. You don’t have to get the IFS code 
exactly right in order to capture a good 
likeness of your original image. More- 
over, the IFS code is robust: Small per- 
turbations in the code will not result in 
unacceptable damage to the image. In 
each of the above examples, we have used 
four transformations to encode the image. 
However, any number can be used. 

For example, the spiral image in figure 
13 can be encoded with just two contrac- 
tive affine transformations. See if you can 
find them. Then determine the IFS trans- 
formation coefficients and input them to 
the random iteration algorithm to get the 
spiral back again. 
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Assigning Probabilities 

Once you have defined your transforma- 
tions, you need to assign probabilities to 
them. Different choices of probabilities 
do not in general lead to different images, 
but they do affect the rate at which vari- 
ous regions or attributes of the image are 


IFS Decoding 
in BASIC 


L isting A is a BASIC implementation 
of the random iteration algorithm. 
It includes the data for the Sierpinski tri- 
angle, but you can use it to process any 
IFS tables. In particular, you will want 
to try the data in tables 2, 3, and 4. Be 
sure to set the variable m correctly; it 
tells the program how many transforma- 
tions are in the IFS. 

It is also essential that the probabili- 
ties in p() add up to 1. For speed, the 
transformations should be listed in de- 
scending order of probability: the high- 
est probability transformation first, and 
the lowest probability last. 

The program includes variables for 
rescaling and translating the origin to 
accommodate the range of the points be- 
ing plotted to the limits of your screen. If 
the image is too wide, decrease xscale; 
if the points are too close horizontally, 
increase xscale. Adjust yscale simi- 
larly to get a good vertical point spread. 
To move the image, adjust xoffset and 
yoffset. 

You can do these adjustments by trial 
and error: Run the program; interrupt it 
and change the offsets and scale factors; 
and run it again. Or, you can replace the 
plot command pset with a command to 
print the values of x and y and run the 
program to get an exact idea of the range 
of points being plotted, so you can adjust 
the scale and offsets more precisely. 

Another way to arrange the program 
is to have it read all the data—m, a(), 
b(), e(), a(), e(), fC), pd), 
xscale, yscale, xoffset, and yoff- 
set—from a disk file specified by the 
user. Instead of reading in the coeffi- 
cients a, b, c, and d, you may want to 
read in angles 6 and ¢ and scale factors r 
and s, and then calculate the 
coefficients. 

The random iteration method is com- 
putation-intensive, so we recommend 
use of a compiler such as Microsoft’s 
QuickBASIC or Borland’s Turbo 
BASIC. If your computer has a floating- 
point coprocessor and your compiler 
supports one, so much the better. 





filled in. Let the affine transformations amount of time that the randomly dancit g 


W, corresponding to an image / be point should spend in the subimage W, is 
approximately equal to 
y C; d, y i ——_——ee 
area of J 
where i = 1, 2, 3,..., m. Then the continued 







































Listing A: A BASIC program demonstrating the use of the random 
iteration algorithm to reconstruct an IFS-compressed image. 


1@ ‘Allow for a maximum of 4 transformations in the IFS 
2@ DIM a(4), b(4), cf(4), d(4), e(4), £(4), pl4) 


48 ‘Transformation data, Sierpinski triangle 

5@ 'First comes the number of transformations 

68 ‘then the coefficients a through f and probability pk 
78 ‘The values for pk should be in descending order. 

88 DATA 3 

98 DATA .5,8,9, .5,8,8, .34 

188 DATA .5,9,8, .5,1,8, .33 

118 DATA .5,8,8,.5,.5,.5, .33 

‘Read in the data 
148 READ m 

158 pt = @ ‘Cumulative probability 

16@ FOR j = 1TOm 

178 READ a(j), b(j), c(j), dj), e(3), £3). pk 
188 pt = pt + pk 

198 p(j) = pt. 





288 NEXT jj 

219 ’ 

228 ’Set up for Graphics 

238 SCREEN 3 ‘Select graphics screen 


248 xscale = 358 ‘Map [8,1] onto [8,358] 
258 yscale = 325 *"Map [8,1] onto [8,325] 
268 xoffset = 8 


278 yoffset = 8 ’Leave the y~-origin 
288 ° 

298 ’Initialize x and y 

388 x = @ 

318 y = 8 


’ 


"Do 2588 iterations 
348 FOR n = 1 TO 2588 
358 pk = RND 





368 ‘The next line works for m<=4. It must be modified 

378 ‘for values of m> 4. 

388 IF pk <= p(1) THEN k = 1 ELSE IF pk <= p(2) THEN k = 2 
ELSE IF pk <= p(3) THEN k = 3 ELSE k = 4 

398 newx = a(k) * x + b(k) * y + e(k) 

488 newy = c(k) * x + d(k) * y + £(k) 

418 x = newx 

428 y = newy 

432 “Use PRINT x,y instead of the PSET line 

448 ‘to see the range of coordinates. Then fix 

458 ‘xscale, yscale, xoffset, and yoffset 

468 IF n > 18 THEN PSET (x * xscale + xoffset, y * yscale 
+ yoffset) 

478 NEXT n 


’ 


LOCATE 24, 35 

588 PRINT "Press any key to end."; 
518 WHILE INKEYS = ™" 

52@ WEND 















‘Return to text screen 
558 SCREEN 2 
END 





IMAGE COMPRESSION 











Figure 12: The Collage Theorem is applied to a leaf. The collage at lower left isn’t 
much good, so the corresponding IFS image, shown at lower right, is a poor 
approximation. But as the collage improves, upper left, so does the IFS image. 





r this spiral image? Only two 
transformations are needed. 








So long as ad — cd is not 0, it is a stan- 
dard calculus result that our ratio equals 
the determinant of the transformation ma- 
trix for W,. So a good choice for the prob- 
ability p, is 


a,d, — b,c; 
py |a,d,—b,c,| 


provided none of these numbers p, comes 
out to be 0. A 0 value should be replaced 
by a very small positive value, such as 
0.001, and the other probabilities corre- 
spondingly adjusted to keep the sum of all 
the probabilities equal to 1. 

We now summarize the compression 
and decompression process: An input 
image is broken up into segments through 
image-processing techniques. These 
image components are looked up in the 
IFS library using the Collage Theorem, 
and their IFS codes are recorded. When 
the image is to be reconstructed, the IFS 
codes are input to the random iteration al- 
gorithm. The accuracy of the recon- 
structed image depends only on the toler- 
ance setting used during the collage 
mapping stage. 


Applications 

For graphics applications, we use a more 
sophisticated procedure that allows full- 
color images to be encoded. Combina- 
torial searching algorithms can be used to 
automate the collage mapping stage. Fig- 
ures 2, 3, and 4 were obtained using IFS 
theory at compression ratios in excess of 
10,000 to 1. These images were based on 
photographs in recent issues of National 
Geographic. A full-sequence video ani- 
mation, A Cloud Study, was shown at 
SIGGRAPH ’87. This was encoded at a 
ratio exceeding 1,000,000 to 1 and can be 
transmitted in encoded form at video 
rates over ISDN lines SDN stands for 
integrated services digital network, a 
concept for integrated voice and data 
communications). A frame from the ani- 
mation is shown in figure 5. 

The IFS compression technique is 
computation-intensive in both the encod- 
ing and decoding phases. Computations 
for the color images were all carried out 
on Masscomp 5600 workstations (dual 
68020-based systems) with Aurora 
graphics. Complex color images require 
about 100 hours each to encode and 30 
minutes to decode on the Masscomp. 

For practical applications, you need 
custom hardware that can speed the en- 
coding and decoding process. An experi- 
mental prototype, the IFSIS (iterated 
function system-image synthesizer), de- 
codes at the rate of several frames per 
second. The IFSIS device was produced 
from a cooperative effort between GTRC, 
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DARPA, Atlantic Aerospace Electronics 

Corporation, and Iterated Systems, and it 

was demonstrated on October 5, 1987, at 

the third annual meeting of the Applied and 

Computational Mathematics Program of 

DARPA. It can be connected to a personal 

computer through a serial port; the per- 

sonal computer sends the IFS codes to the 

device, which responds by producing com- 
plex color images on a monitor. 

The IFSIS is a proof of concept for 
faster devices with higher resolution. 
Once the higher-performance IFSIS de- 
vices are combined with ISDN telecom- 
munication, full-color animation at video 
rates over phone lines will be a reality. 

Another area for future application of 
IFS encoding is automatic image analy- 
sis. What’s in a picture? Does it show a 
spotted sandpiper or a robin? The more 
complex the image or the more subtle the 
question, the harder it becomes for an al- 
gorithmic answer to be formulated. But 
here’s the point: Whatever the answer, it 
will proceed faster if stable, compressed 
images are used. The reason for this is 
that image-recognition problems involve 
combinatorial searching, and searching 
times increase factorially with the size of 
the image file. 

During the spring of 1987, Iterated 
Systems was incorporated to develop 
commercial applications of IFS image 
compression. It is exciting to see how an 
abstract field of mathematics research is 
leading to new technology with implica- 
tions ranging from commercial and in- 
dustrial work to personal computing. & 
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Chaotic Compression 


A new twist on fractal theory speeds 
complex image transmission to video rates 


By Michael F. Barnsley 
& Alan D. Sloan 


associated with computer 
graphics severely check the 
use of low-bandwidth communica- 
tions devices for image transfer. 
Multiple users can easily overload 
even the markedly higher bandwidth 
of local area networks. Potential 
applications go unrealized as the 
cost of transmitting graphics in- 
creases directly with improved per- 
formance and greater complexity. 
But algorithms based on fractal 
geometry can encode image data at 
high compression ratios—in excess of 
10,000 to one. It is possible to trans- 
mit such compressed image files at 
video rates (30 frames per second) 
over normal telephone lines. A new 
graphics device, the Iterated Func- 
tion System Image Synthesizer, 
makes it possible to decode these 
compressed files at several frames 
per second. The technology holds 
promise over other solutions to the 
problem of transmitting today’s 
sophisticated computer graphics. 


| he massive volumes of data 


A Perspective on Transmission 

A 1000-by-1000-resolution com- 
puter graphics image with eight 
bits of color information per pixel 
needs one megabyte of data for speci- 
fication. The transmission of this 
image over a 2400-bit-per-second 
line takes over 55 minutes. Com- 


Michael F. Barnsley and Alan D. Sloan are 
professors of mathematics at the Georgia 
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pounding the problems of cost and 
delay, extended transmission times 
expose data to degradation. 

A one-megabyte-per-second Ether- 
net line transmits the same image 
in one second, but two users who 
are each exchanging one minute of 
animation of such images consume 
an hour of time. Given these bur- 
dens, it’s easy to see why there is 
no multiple-user network for ani- 
mating high-resolution images at 
video rates. 

Transmission times are propor- 
tional to transmission bandwidths. 
At current rates, for example, it 


takes three hours at a total trans- 
mission cost of $42 (25 cents per 
minute) to send our sample image 
1000 miles on a 2400-bps toll line. 
The same image can be sent to the 


same address in 16 seconds at a cost 
of 51 cents on a 56,000-bps T1 line. 


Confounding Performance 
Paradoxically, continuing improve- 
ments in computer graphics hard- 
ware and software only compound 
the problem. Faster hardware in- 
creases the rate of graphics produc- 
tion and thus increases demand on 
transmission. The quest for realism 


Ten thousand-to-one compression ratios can be realized with a 
new method based on chaos theory, fractal geometry, and affine 
transformations. While transmission can approach or exceed video 
rates, the challenge now is to reconstruct images as rapidly. 
Below: “Sunflower Field’’ and “Arctic Wolf,”” images compressed at 
10,000:1 at Georgia Tech. 
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drives graphics software to produce 
images of increasing complexity. 
_ Greater image complexity means 
more information that must be trans- 
mitted for each image. 

Sophisticated printers also increase 
transmission requirements. A new 
color printer from Tektronix is a 
case in point. Producing an A-size, 
24-bit-deep color printout at 300 dpi 
can chew up over 9M of data. Even 
at the Integrated Services Digital 
Network bandwidth of 64,000 bps, 
it will take over 15 minutes to supply 
the necessary data to the printer. 
Data compression and reconstruction 
clearly offer the greatest potential 
for relief from the costs associated 
with even the fastest communica- 
tions technologies. 

Digital data compression consists 
of encoding and decoding phases. In 
the encoding phase, an input string 
of integer-based (I) bits is trans- 
formed into a coded string of com- 
plex number-based (C) bits. The 
ratio of I to C bits is the compres- 
sion ratio. In the decoding phase, 
the compressed string regenerates 
the original data. If the compres- 
sion ratio is two to one, then in the 
time it would take to transmit one 
uncompressed image, two compressed 
images can be transmitted. 

High-ratio image compression 
slashes transmission time and costs. 
For example, an uncompressed 
minute of 30 frames-per-second ani- 
mation for a 24-bit-deep, 1000-by-1000 
color image takes seven months to 
transmit over a 24-bps line at a cost 
of $75,000. Compressed at a 10,000-to- 
one ratio, the same minute of ani- 
mation takes 30 minutes to trans- 
mit at a cost of about $7.50. On a 
1.544 million-bps line, uncompressed 
transmission takes eight hours at 
a cost of $339; compressed at 10,000 
to one, it takes just three seconds 
at a cost of four cents. 

There are several major classes 
of image-compression techniques, 
including transform and predictive 
types. Transform techniques are 
destructive in the sense that the 
original image cannot be recon- 
structed exactly from the image 
code. Zoran Corp. (Santa Clara, CA) 
offers a chip-level image compres- 
sion processor that employs trans- 
form techniques. And the Inter- 
national Standards Organization 
and International Telegraph & 
Telephone consultative committee 





have been working on image com- 
pression standards. But the com- 
pression ratios being discussed are 
basically pretty low. 

For example, one exact compres- 
sion method—a type of method call- 
ing for exact reconstruction of an 
image—calls for a ratio of just two to 
one. There are several exact methods, 
but they aren’t particularly useful. 
In one experiment, a photo montage 
was run-length encoded, producing 
a 60-percent increase in data length. 
Unix provides a Huffman encoder 
which yields a compression ratio of 
10 to nine. A Ziv-Lempel compression 
algorithm in public domain yields 
a compression ratio of five to four. 


on a line between P(n) and Vk 
where V, = A, V, = B, and V, = C. 
Throw away the first 25 points and 
plot the rest. The result is indepen- 
dent of the initial point P(0) as well 
as the sequence of random numbers 
which were chosen in its generation. 
When suitably generalized, this 
process leads to full-color images 
like the sunflowers and Arctic wolf 
on the preceding page. Inspired by 
National Geographic photographs, 
Georgia Institute of Technology 
graduate students Laurie Reuter 
and Arnaud Jacquin encoded each 
of these images at a ratio exceed- 
ing 10,000 to one. Both images are 
decoded by a random iteration al- 
gorithm like the one described. 
The chaotic-compression technique 
is computation-intensive in both 
the encoding and decoding phases. 


An experimental device reconstructs 
compressed video images at the rale 
of about four frames per second. 


Such ratios are common to exact 
compression methods. 

General Electric has announced 
the Digital Video Interactive com- 
pression system developed at the 
David Sarnoff Research Center 
(“DVI Video/Graphics,” July, p.125). 
As the DVI compression ratio in- 
creases above 10 to one, the loss of 
high-frequency information becomes 
more apparent. But fine detail isn’t 
as critical for motion video as for 
still images, so the DVI can produce a 
credible video at a 100-to-one ratio. 

The new method described here 
has been used to encode complex 
graphics at exact compression ratios 
of 10,000 to one. It can be used with 
classical compression techniques to 
increase yields even further—up to 
one million to one in the case of “A 
Cloud Study,” shown at SIGGRAPH 
’°87. The method is based on chaotic 
dynamic systems theory. A simple 
example illustrates the procedure. 

Given a triangle with vertices A, 
B, and C, choose an initial point P(Q) 
at random in the triangle. An itera- 
tive procedure (affine transformation) 
transforms a point P(n) to P(n + 1) 
at the n“ step as follows. 

Choose a number k at random 
from among the numbers 1, 2, or 3. 
The point P(n + 1) is the midpoint 


An experimental prototype device 
introduced at the Meeting of the 
Applied and Computational Mathe- 
matics Program of the Defense Ad- 
vanced Research Projects Agency 
(DARPA) in October, the Iterated 
Function System Image Synthesizer 
(IFSIS) decoded video images at the 
rate of about four frames per second. 
Because of the extreme compression 
ratios, this device can be coupled 
loosely to any host, which will treat 
it as if it were a printer—only it pro- 
duces complex color images instead 
of text. 

Networking and high-resolution, 
real-time animated graphics can be 
realized simultaneously by means 
of IFSIS-type devices. The demon- 
stration at the DARPA meeting 
served as proof-of-concept for faster 
transmission devices with higher res- 
olution. Once the higher-performance 
IFSIS devices are combined with 
64,000-bps telecommunications, full- 
color animation transmission and 
reconstruction at video rates over 
phone lines will become a commer- 
cial reality. 

Research into rapid image decod- 
ing and reconstruction methods 
continues with the objective of the 
development of a marketable sys- 
tem within a year. CGW 
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one. There are several exact methods, 
but they aren’t particularly useful. 
In one experiment, a photo montage 
was run-length encoded, producing 
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An experimental device reconstructs 
compressed video images at the rate 
of about four frames per second. 
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General Electric has announced 
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